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ABSTRACT: We give a probabilistic analysis for the randomized game tree 
evaluation algorithm of Snir. We first show that there exists an input such that the 
running time, measured as the number of external nodes read by the algorithm, on 
that input is maximal in stochastic order among all possible inputs. For this worst 
case input we identify the exact expectation of the number of external nodes read 
by the algorithm, give the asymptotic order of the variance including the leading 
constant, provide a limit law for an appropriate normalization as well as a tail 
bound estimating large deviations. Our tail bound improves upon the exponent of 
an earlier bound due to Karp and Zhang, where subgaussian tails were shown based 
on an approach using multitype branching processes and Azuma's inequality. Our 
approach rests on a direct, inductive estimate of the moment generating function. 

1 Introduction 

In this note we analyze the performance of the randomized algorithm to evaluate 
Boolean decision trees proposed by Snir (1985). Given is a complete binary tree of 
height 2k, k > 1, where the root (at depth 0) is labeled A as are all internal nodes 
with even depth, all internal nodes with odd depth are labeled V. The n = 2 2k 
external nodes are labeled either or 1 and the objective is to calculate the value 
of the root. For each node its value is given as the value of the operation labeled at 
that node applied to the values of its children. The cost for evaluating the Boolean 
decision tree is measured as the number of external nodes read by the algorithm. 

Snir proposed and analyzed the following randomized algorithm to evaluate 
a Boolean decision tree: At each node one chooses randomly (with probability 
1/2) one of its children and calculates its value recursively. If the result allows 
to identify the value of the node (that is a for a A-labeled node and a 1 for a 
V-labeled node, respectively) one is done, otherwise also the other child's value 
has to be calculated recursively in order to obtain the value of the node. Applying 
this to the root of the tree yields the value of the Boolean decision tree. 

The advantage of this algorithm over any deterministic algorithm is that 
for any input at the external nodes its expected cost is sublinear in n, whereas 
any deterministic algorithm has linear worst case cost. More precisely, Saks and 
Wigderson (1986) obtained that the maximum expected cost is of the order S(n Q ) 
with a = log 2 ((l 4- \/33)/4) = 0.753 and showed that this is also a lower bound 
on the maximum expected cost for any other randomized algorithm to evaluate a 
Boolean decision tree; see also Motwani and Raghavan (1995, Chapter 2) for an 
account on this subject. Further analysis was given by Karp and Zhang (1995). 
For certain regular inputs at the external nodes the cost of the algorithm can be 
represented via 2-type Galton- Watson processes. Karp and Zhang showed that 
the normalized cost has subgaussian tails. That argument was based on Azuma's 
inequality. 

We denote the input of 0's and l's at the external nodes as a vector v £ {0, 1}" 
and the number of external nodes read by the algorithm on input v by C(v ). We 
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will see subsequently that for particular v* € {0, 1}™ not only the expectation of 
the cost of the algorithm is maximized, i.e., EC(t)*) = max^^i}™ EC(w), but also 
that C(v*) is maximal in stochastic order, C(v) -< C(v*) for all v e {0, 1}™. Here, 
X <Y for random variables X, Y denotes that the corresponding distribution 
functions Fx,Fy satisfy Fx{x) > Fy{x) for all x e R, or, equivalently, that 
there are realizations X',Y' of the distributions £(X),C(Y) of X,Y on a, joint 
probability space such that we pointwise have X' <Y' . 

From this perspective it is reasonable to consider C(v*) as the worst case 
complexity of the randomized algorithm and to analyze its asymptotic probabilistic 
behavior. Our results for the exact mean of C(v*), the asymptotic growth of its 
variance including the evaluation of the leading constant, a limit law for C(v*) 
after normalization as k — > oo together with an explicit tail estimate are based on 
a recursive description of the problem. Since v* is a regular input in the sense of 
Karp and Zhang, also their 2-type Galton- Watson approach applies. 

Our main finding is an improvement of the tail bound exp(— const t 2 ) for 
t > 0, to exp(— const t K ) 1 with 1 < k < 1/(1 — a) = 4.06, see Theorem 3.6. 
This is based on a direct, inductive estimate of the moment generating function. 
Our approach is also applicable to any regular input as well as to other related 
problems. 

The paper is organized as follows: In section 2 we explain, how a worst case 
input v* is obtained. Section 3 contains the statements of the results. In sections 4 
and 5 the 2-type branching process of Karp and Zhang (1995) is recalled and the 
recursive description of the quantities, that our analysis is based on, is introduced. 
Section 6 contains the proofs of our results and section 7 has extensions to m-ary 
Boolean decision trees. 

2 Worst case input 

In this section we explain how a worst case input v* is constructed. We first have 
a look at the case k = 1 and v e {0, l} 4 such that the decision tree is evaluated 
to 1 at the root. Clearly both children of the root have to lead to an evaluation 
of 1. Now each pair of external nodes attached to the children needs to have at 
least one value 1. Note that the algorithm reads in both pairs of external nodes 
until it finds the first one. Hence there will in total be read two l's no matter how 
v e {0, l} 4 is drawn among the choices that lead to an evaluation of 1 for the 
decision tree. Clearly, to maximize the number of 0's being read we choose in each 
pair of external nodes one and one 1. Then both 0's are being read independently 
with probability 1/2. Hence, v\ — (0, 1,0, 1) stochastically maximizes C(v) for all 
v £ {0, l} 4 such that the decision tree evaluates 1, see Figure 1. 

Analogously look at the case k = 1 and v € {0, l} 4 such that the decision 
tree is evaluated to 0. Clearly, one child of the root has to have the value 0, whose 
external nodes attached need to have both values 0. If we choose also value for 
the other child of the root, we are lead to v = (0, 0, 0, 0), and the algorithm reads 
exactly 2 external nodes with values both 0. Therefore, to stochastically maximize 
C(v) we choose the second child of the root with value 1 and again its external 
nodes attached with values and 1. Then, v — (0, 0, 0, 1) stochastically maximizes 
C(v) for all v £ {0, l} 4 for which the decision tree evaluates to 0, see Figure 1. 

Since we have C(vo) di C(vi), it follows that v* — (0, 1, 0, 1) is a choice with 
C(v) d C(v*) for all v E {0, l} 4 . For general k > 2 a corresponding v* = v*(k) 
can recursively be constructed from v*(k — 1) as follows: Each component in 
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v*(k — 1) is replaced by the block 0, 0, 0, 1, whereas each 1 is replaced by the block 
0, 1, 0, 1. For example, for k = 3, this yields 

v* = (0,0,0,1,0,0,0,1,0,0,0,1,0,1,0,1, 
0,0,0,1,0,1,0,1,0,0,0,1,0,1,0,1, 
0,0,0,1,0,0,0,1,0,0,0,1,0,1,0,1, 
0,0,0,1,0,1,0,1,0,0,0,1,0,1,0,1). 

In Proposition 3.1 we show that this construction yields a v* with C(v) ^ C(v*) 
for all v e {0,1}™ and k > 1. 





Figure 1: Shown are decision trees for k = 1 evaluating at the root to 1 and 0, 
respectively, together with a choice for the external nodes that stochastically max- 
imizes the number of external nodes read by the algorithm. 



If we would only want to stochastically maximize the cost over all v G 
Ro(n) C {0,1}™ that evaluate to a at the root, the same recursive construc- 
tion of replacing digits by corresponding blocks, starting with vq = (0,0,0,1), 
yields a» t e Ro(n) such that C(v) -< C(v+) for all v G Ro(n). 



3 Results 

We assume that we have n = 2 2k with k > 1 and denote by v* G {0, 1}" an input 
as constructed in section 2. 

Proposition 3.1 For v* G {0, 1}™ as defined in section 2 we have C(v) ^ C(v*) 
for all v G {0, 1}™. 

The stochastic worst case behavior C(v*) of the randomized game tree evaluation 
algorithm has the following asymptotic properties: The subsequent theorems de- 
scribe the behavior of mean, variance, limit distribution, and large deviations of 
C(v*). For the mean we have: 

Theorem 3.2 The expectation of C(v*) is given by EC(ti*) = c\n a — c 2 n /3 7 with 

a = log 2 ^ , /? = log 2 , c 1 = - + ^-j=, C2 = ci-1. 

We denote for sequences (a^), (&&) by afe ~ bk asymptotic equivalence, i.e., a^/bk — > 
1 as k — > oo. Then we have for the variance of C(u*): 
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Theorem 3.3 The variance of C(v*) satisfies asymptotically Var C(v*) ~ dn 2a 
as k — > oo, where d = 0.0938. The constant d can also be given in closed form. 

For random variables X, Y we denote by X = Y equality in distribution, i.e., 
C(X) = C(Y). Then we have the following limit law for C(v*): 

Theorem 3.4 ForC(v*) we have after normalization convergence in distribution, 

> G, k — > oo, 



where the distribution of C is given as C{C) = C{G\) and C(G) = £(Gq,Gi) is 
characterized by E || G || 2 < oo, E G — (cq, ci), wii/i c = 1/2 + 5/(2v33), and 
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lOT'tt G' 1 ',.. .,G( 4 ), Si,S 2 independent with C(G^) = C(G), forr = 1,.. .,4, and 
£(Bi) = C(B 2 ) = 5(1/2). ffere, 5(1/2) denotes the Bernoulli^/ 2) distribution. 

For the estimate of large deviations we rely on Chernoff 's bounding technique. We 
need to follow a bivariate setting for the vector (G(u*), C(v*)) as introduced in 
section 5. The following bound on the moment generating function is obtained: 

Proposition 3.5 It exists a sequence (Yk)k>o — (ifc.Oi ^fc,i)fe>o of bivariate ran- 
dom variables with marginal distributions C((C(v*) — EC(t)*))/n a ), C((C(v+) — 
EC(Vi,))/n a ) such that for all q > l/a = 1.33 there is a K > with 

Eexp(s,y fc )<exp(/q S ||<0 (1) 

for all s £ M 2 and k > 0. An explicit value for K = K q is given in ^j). 

The bound on the moment generating function in the previous proposition implies 
a large deviation estimate via Chernoff bounds: 

Theorem 3.6 For all 1 < K < 1/(1 — a) = 4.06 there exists an L > such that 
for any t > and n = 2 2k 



( C(v*) -EC(v*) 
V n a 



> t) < exp(-Lt K ). 



(2) 



An explicit value for L is given in f3|). The same bound applies to the left tail. 

The approach of Karp and Zhang (1995) based on Azuma's inequality gives the 
tail bound exp(— L't 2 ) for an explicitly known L' . For n = 2 the prefactor L = L 2 
in Theorem 3.6 can also be evaluated and satisfies L 2 > 11L'. 



4 Karp and Zhang's 2-type branching process 

For the analysis of C(v*) note that whenever the algorithm has to evaluate the 
value of a node at a certain depth that yields a 1, according to the discussion of 
section 2, the algorithm has to evaluate the values of two nodes of depths two levels 
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below that each yield a 1 , and B3 + B4 nodes of depths two levels below that each 
yield a 0, cf. Figure 1. Here, B 3l Bi are independent Bernoulli B(l/2) distributed 
random variables. Analogously, when the algorithm has to evaluate the value of 
a node at a certain depth that yields 0, two levels below it has to evaluate B\ 
nodes yielding a 1 and 2 + B\B 2 nodes yielding a 0, where Bi,B 2 are independent 
5(1/2) distributed random variables. Here, the event {B\ = 1} corresponds to the 
algorithm first checking the right child of the node to be evaluated and {B 2 = 1} 
to first checking the left child of that child, cf. Figure 1. Since at each node the 
child being evaluated first is independently drawn from all other choices, this gives 
rise to the following 2- type Galton- Watson branching process. 

We have individuals of type and 1 where the population of the fc-th gen- 
eration corresponds to the number of nodes at depth 2k that are read by the 
algorithm. We consider processes starting either with an individual of type 1 or 
type and assume that the algorithm is applied to the worst case inputs v* and 
«*, respectively. Then we have the following offspring distributions: An individual 
of type 1 has an offspring of 2 individuals of type 1 and B3 + B4 individuals of 
type 0. An individual of type has an offspring of B\ individuals of type 1 and 
2 + B\B 2 individuals of type 0. We denote the number of individuals of type 

and 1 in generation k by {Vn\Wn^), when starting with an individual of type 
i = 0, 1, where n = 2 2k . Note that for v* , € {0, 1}™ we have the representations 

c{v*) ± v w + wKD, ck) i + w^. 

This is the approach of Karp and Zhang (1995) for regular inputs like v*, v*. Hence, 
part of the analysis of C(v*) can be reduced to the application of the theory of 
multitype branching processes; see for general reference Harris (1963) and Athreya 
and Ney (1972), and for a survey on the application of branching processes to tree 
structures and tree algorithms see Devroye (1998). 

However, we will also use a recursive description of the problem. This will be 
given in the next section and enables to use as well results from the probabilistic 
analysis of recursive algorithms by the contraction method. 



5 The recursive point of view 

It is convenient to work as well with a recursive description of the distributions 
£(C(v*)) and £(C(v*)). For this, we define the distributions of a bivariate random 
sequence (Z n ) = (Z nfi , -Zn,i) for all n = 2 2k , k > by Z\ = (1, 1) and, for k > 1, 



7 A 7(1) , 7(2) , 



B\B 2 
1-B 2 



71/4 





B x 



Br 




7 (4) 

J n/4' 



where Z 



(i) 

n/4> 



Z^) A , Bi,B 2 are independent, B\,B 2 are Bernoulli B{l/2) dis- 



tributed and jC(Z^J 4 ) = ■ ■ ■ = £(Z^J 4 ) = C{Z n / A ). It can directly be checked by 

induction that the marginals of Z n satisfy £(Z n fi) = £(C(u*)) and £(Z ni i) — 
£(C(v*)). Note that Z n $ and Z n ^\ become dependent, firstly, since we have cou- 
pled the offspring distributions using for the second component of Z n again B\ 
and 1 — B 2 instead of B 3 and B4, cf. section 4, and, secondly, since the first 
component of Z y n j 4 contributes to both components of Z n . Sequences satisfying 
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recursive equations as (Z n ) are being dealt with in a probabilistic framework, the 
so called contraction method; see Rosier (1991, 1992), Rachev and Riischendorf 
(1995), Rosier and Riischendorf (2001), and Ncininger and Riischendorf (2004). 

6 Proofs 

In this section we sketch the proofs of the results stated in section 3. 

Proof of Proposition 3.1: (Sketch) We denote by R (n), i?i(n) C {0, 1}™ the 
sets of vectors at the external nodes at depth 2k that yield an evaluation at the 
root of the decision tree of value and 1, respectively. From the discussion in 
section 2 we have 

C(v)<C(v+), veR (n), and C(v) r< C(v*), v e Ri(n). 

Hence, it remains to show that C(w*) ^ C(v*). This is shown by induction on 
k > 1. For k — 1 this can directly be checked. For the step k — l^k assume that 
we have C(v±(k— 1)) ^ C(v*(k— 1)). It suffices to find realizations of the quantities 

{V^\wi 1] ) and (y„ (0) ,wi 0) ) on a joint probability space with K (0) + W { n ] < 
VP + W { n ] almost surely, n = 2 2k . 

For this we use B,B', (vffi ,wffi) for i = 1, 2, j = 1, . . . , 4 being indepen- 
dent for each i = 0,1 and with B,B' Bernoulli 5(1/2) distributed, C(V^h j ) = 

C(V^\), £(wfy j ) = C(W ( ^) for i = l,2 and j = 1.....4. By the induction 

hypothesis we may assume that we have versions of these random variates with 

V §l J + W n/i ^ V nV + W n/i for j = 1, • • • , 4. With this coupling we define 

(K (1) , W [ a ] ) and (V4 (0) , Wi 0) ) according to the values of B, B': On {B = 1,B' = 0} 
we set 




and obtain V4 (0) + < + . On the remaining sets {B = 0,B' = 

0}, {B = 0,B' = 1}, and {B = 1,B' = 1} similar couplings of (V4 ( °\ wi° } ), 
(K (1) , Wi 1] ) can be defined with K (0) + W^ 0) < + wi 1] . ■ 

Proof of Theorem 3.2: (Sketch) Assume that a generation has {wq,w\) indi- 
viduals of type and 1. Then, by the definition on the offspring distribution in 
section 4, the expected number of individuals in the subsequent generation is given 
by 

«(»»), „:= 



9/4 1 
1/2 2 
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Since C(v*) = C(v*(k)) is the sum of the individuals at generation k for the 
process started with an individual of type 1 we obtain 



EC(v*) = (l,l)M fe Q 



The matrix M has the eigenvalues Ai 
its fc-th power can be evaluated to 



M k = 



2v/33 



(17 + V33)/8 and A 2 = (17- V33)/8 and 



(V33 + 1)A{ + (V33 - 1)A§ _ 8(A{ - A|) 

4(Af-A§) (V33-l)Af + (V33 + l)A| 



From this, EC(v*) and various constants needed subsequently can be read off. 
Note, that A* = n a with a given in Theorem 3.2 and n = 2 2fe . ■ 

Before proving Theorem 3.3 it is convenient to first prove Theorem 3.4. 

Proof of Theorem 3.4: (Sketch) The 2-type branching process defined in section 
4 is supercritical, nonsingular, and positive regular. Hence, a theorem of Harris 
(1963) implies that 



M 1} . 



Y 



almost surely, as k — > oo, where Y is a nonnegative random variable and (^1,^2) 
a deterministic vector that could also be further specified. Thus we obtain 



C{v*) 



C 



n" 



in distribution, as k — > 00, with C{C) = C{{u\ + V2)Y). 

On the other hand the recursive formulation of section 5 leads after the 
normalization X n := Z n /n a to 



for k > 1, where A\ = A2 = (l/4 Q )/2, with the 2x2 identity matrix I2, and 



1 



where X 



(i) 



A3 = — 



4' 

K4) 



BiB 2 
I-B2 



A 4 = — 



Si 
Si 



(3) 



V4' ' ' ' '" v n/4' ^I'^s are independent with £(^„/ 4 ) = £(X n/4 ) for r 
1, . . . ,4 and C(Bi) = £(-62) = 5(1/2). It follows from the contraction method 
that X„ converges weakly and with all mixed second moments to some G, that 
can be characterized as in Theorem 3.4. For details, how to apply the contraction 
method, see Theorem 4.1 in Neininger (2001). Thus, we have C(v*)/n a — > G\ in 
distribution. ■ 
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Proof of Theorem 3.3: (Sketch) As shown in the proof of Theorem 3.4 we 
have the convergence X n = Z n /n a — > G for all mixed second moments. This, 
in particular, implies VarX„ i — > VarGi. The variances of G\ can be obtained 
from the distributional identity for G stated in Theorem 3.4. Then we obtain 
VarG(u*) = Vax(n a X n<1 ) ~ dn 2a with d = VarGi. ■ 



Proof of Proposition 3.5: For Y n = (\/n a )(Z n — KZ n ) we have marginals 
C(Y nA ) = C((C(v*) - EC(v*))/n a ) and C(Y nfi ) = £((<?(«*) - EC(«*))/n ). The 
distributional recurrence for Z„ from section 5 implies the relation 

4 

— ^ t A r Y n ^ + 6 n , fc > 1, 
r=l 

with y « , . . . , Y^\ , B x , B a independent, = £(F n/4 ),for r = 1, . . . , 4, C{B l ) 

C(B 2 ) = 5(1/2) and 6 n = (l/n a )(4"^ =1 (i r E2„ /4 ) -E2„). The matrices A r 
are given in lf3"jl. 

We prove the assertion by induction on k. For k = we have Yq = 0, thus 
the assertion is true. Assume the assertion is true for some n/4 = 2 2 ' fe_1 ^. Then, 
conditioning on (A±, . . . ,A 4 ,b n ), denoting the distribution of this vector by a n , 
and using the induction hypothesis, we obtain 

Eexp(s,T„) = / exp(s, /3 n ) E exp(s, a r Y n / 4 )da n (ai, . . . , a 4 , (3 n ) 

J r=l 

< / cxp(s,/3„) Y\_cxp(K\\a^s\\ q )da n (a 1 ,... 1 a 4 ,P n ) 



< J exp ^(8,pn) + K\\s\\ q J2\\ a r\\ q o^j da n (ai,...,<H,Pn) 

= E exp«s, b n ) + K\\s\\ q U) eMK\\s\\ q ), 

with U := E 4 =i (HA-Hgp) - 1 = 4- Q »(2 + BiBa + (1 - B 2 ) + B t ) - 1 and ||A|| op = 
su P||a:||=i ll^ll f° r matrices A. Hence, the proof is completed by showing 

supE exp((s, b n ) + K\\s\\ q U) < 1, 
fe>i 

for some appropriate K > 0. We denote £ := — ess sup J7 = 1 — 4 1 ~ a<? , thus q> 1/a 
implies £ > 0. 

Small \\s\\: First we consider small ||s|| with ||s|| < c/sup fc>1 ||6 n ||2,oo for some 
c > , where 1 1 b n \ | 2 , 00 : = 1 1 1 1 b n \ \ \\oo, the inner norm being the Euclidean norm. Note 
that throughout we have n — n(k) = 2 2k . For these small ||s|| we have 

E exp(((s,6„) +K\\s\\ q U) < exp(-K \\s\\ q ()E exp(s,6„) 
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and, with E (s, b n ) = 0, 

(s,b n ) k 



Eexp(s,6„) = E 



1 + (*, b n ) + ^ 



k\ 

k=2 



l + E(s,b n ) 2 J2 



(s,b„) 



k-2 



k\ 

k=2 



< 1 + || S || 2 E||M 2 ^ 



00 c k-2 



k=2 

= l + j|, s ||2E||M 2£ — 
Using exp(-if||s||^) < 1/(1 + K||s|| 9 and with *(c) = (e c -l-c)/c 2 we obtain 

Ew ,,, A , t ,,, w < iaatfg . 

Hence, we have to choose if with 

*T> l|a||2 ~ g * (c) supEl|b„|| 2 . 
? fe>i 

With < c/sup fc>1 ||6 n ||2,oo a possible choice is 

= SUPfc^EUfcnll 2 W,(c) 
SUPfe>i ||6n||2^ £ 

with *,(c) = (e c - 1 - c)/c«. 

Large ||s||: For general s £ R 2 we have 

(s,b n ) + K\WU < \\s\\\\b n \\ WaW'KZ < || S ||||& n ||2,oo - INI 9 ^, 
and this is less than zero if 



■ „g_i > sup^ ||6„|| 2 ,oo _ sup fc >! ||6 n 



|3— q 



Ki SUp^E^pi^c)' 

If ||s|| satisfies the latter inequality we call it large. Thus, for large ||s|| we have 
sup fe > 1 Ecxp((s,6„)+^|| S ||9C/) < 1. 

In order to overlap the regions for small and large ||s|| we need 

* l( c) > SUPfc>1 IIM '~ 



sup fe > 1 E||6„||2- 

The right hand side of the latter display can be evaluated explicitly for our problem 
and equals 104/77. Thus, this inequality is true for, e.g., c = 1.53. Hence, with the 
explicit value 

9 sup fe>1 ||Ml2^ 1-539(1 -4i-«°) [> 
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the proof is completed. ■ 



Proof of Theorem 3.6: By Chernoff's bounding technique we have, for u > 
and with Proposition 3.5, 

C(v*)-EC(v*) > \ = P(exp(u y n;l)>exp(ut))) 



< E exp(uY n i — ut) 

= E exp(((0, u), Y n ) — ut) 

< exp(K q u q — ut), 

for all q, K q as in Proposition 3.5 and |@J. Minimizing over u > we obtain the 
bound 

P ( n a > * ) ^ exp(-if K ), 

for 1 < k < 1/(1 - a), with 



K/(K — 1) 

and K K /r K _i\ given in Q. This completes the tail bound 



L = L K = K 1 7« i (5) 

K K/{K — 1) \ ' 



7 m-ary Boolean decision trees 

The analysis can be carried over to the case of m-ary Boolean decision trees. The 
algorithm visits randomly chosen children and evaluates recursively their values 
until the value of the root can be identified, the remaining children are discarded 
afterwards. A worst case input v* £ {0, 1}™ with n = m 2k can be constructed 
similarly. Then we have similar results for C(v*): 

Theorem 7.1 For the worst case complexity C(v*) of evaluating an m-ary Boolean 
decision tree we have the following asymptotics: 

ECO*) = c[ m) n a ™ +c ( ™ ) n l3 "% 

VarCV) ~ d m n 2a ™, 
gK) r 

/C(0-EC(0 



>tj < cxp(-L^ m h K ), t>0, 



with constants , a m> (3 m , d m , > 0, c^ € K, and 1 < k < k,„ = 

1/(1 -a m ). 

Numerical values for a m , c? m and K m are listed in Table 1 . The distribution of C m is 
given as C(C m ) = C(G\) and C(G) — C(Gq, G\) is characterized by E ||G|| 2 < oo, 
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EG= (4 m) ,4 m) ) and 



m— 1 



G 



m 



2a „ 



r=l 



r=l 

m— 1 

+ E 



l r ([7o) 

lr(^o) 



G (r) 



l r (^0)l«(^r) 
1 - U(U r ) 



G (r,£) 



with C{G^) = C(G^) = C(G^) = C(G) and G< r \ G< r \ G^ l \ U r independent 
with £(U r ) = unif{0, . . . ,m — 1} for all r,l. Here, we denote li{Y) := l{ i<Y } f° r 
integer i and a random variable Y, and we have 



to + 3 



2 2^16m+ (to - l) 2 



3m + 1 



2 2^16™+ (m- l) 2 



m 


2 


3 


4 


5 


6 


7 


8 




0.754 


0.759 


0.765 


0.769 


0.774 


0.778 


0.781 




0.0938 


0.0847 


0.0782 


0.0731 


0.0689 


0.0652 


0.0619 




4.060 


4.154 


4.247 


4.336 


4.419 


4.497 


4.571 


111 


9 


10 


11 


12 


13 


14 


15 




0.785 


0.788 


0.790 


0.793 


0.795 


0.798 


0.800 


d m 


0.0590 


0.0564 


0.0541 


0.0519 


0.0499 


0.0481 


0.0464 




4.641 


4.707 


4.769 


4.829 


4.886 


4.940 


4.993 


777 


16 


17 


20 


30 


40 


50 


100 




0.802 


0.804 


0.809 


0.821 


0.830 


0.837 


0.856 


dm 


0.0448 


0.0433 


0.0394 


0.0304 


0.0247 


0.0209 


0.0117 


K m 


5.043 


5.091 


5.226 


5.596 


5.885 


6.123 


6.928 



Table 1: Numerical values of the quantities a m , d m and K m appearing in Theorem 
7.1 for various values of m. 
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